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Abstract 

A Boolean function of n bits is balanced if it takes the value 1 with probability 
1/2. We exhibit a balanced Boolean function with a randomized evaluation procedure 
(with probability of making a mistake) so that on uniformly random inputs, no input 
bit is read with probability more than 0(n _1 / 2 -^/log n). We give a balanced monotone 
Boolean function for which the corresponding probability is 0(n -1 / 3 logn). We then 
show that for any randomized algorithm for evaluating a balanced Boolean function, 
when the input bits are uniformly random, there is some input bit that is read with 
probability at least 0(n -1 / 2 ). For balanced monotone Boolean functions, there is some 
input bit that is read with probability at least 0(n -1 / 3 ). 

1 Results 

Suppose that a randomized algorithm evaluates a Boolean function /onn input bits. When 
the input bits are uniformly random, let 5i = Pr[bit i gets read], where the probability is 
over the randomness of the input as well as the internal randomness of the algorithm. Let 
5 = maxj 5i be the maximum probability that a particular bit is read. How small can S be for 
a balanced Boolean function (one that takes on the values and 1 equally often)? What if 
the Boolean function is monotone (i.e., f(x) > f(y) whenever X{ > yi for alH = 1,2,..., n)? 

An obvious lower bound for 5i is h(f), the influence of the bit i. Recall that the influence 
h = Ii(f) of the i th input bit on / is denned to be the probability, for a uniformly random 
input, that changing the bit changes the value of the function. Many readers are familiar 
with the majority function and the fact that the influence of each bit is small, only 0(n -1 ' 2 ). 
But on most inputs the numbers of O's and l's are nearly balanced, and any algorithm that 
realiably evaluates majority will typically read 0(n) of the input bits, so that 5 = 6(1). 

There are a couple of other balanced Boolean functions that some readers may think 
of. The tribes function on n = m 2 m input bits is defined by partitioning the set of bits 
into "tribes" of size m, and the function takes the value 1 if and only if there is at least 
one tribe where all the bits are 1. The influence of each bit on the tribes function is only 
0(logra/n) [BOL89J (which is as small as maxj/j(/) can be KKL88]). but any algorithm 
that reliably evaluates the tribes function will typically read 0(n/logn) of the input bits, 
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so that 5 = 6(1/ log n). For the dictatorship function, where one bit determines the output, 
very few bits need to be read, but the dictator bit needs to be read, so 5 = 1. 

The tribes function and the majority are two examples of symmetric functions; they are 
invariant under a group acting transitively on the input bits. Many other boolean functions 
of interest are symmetric. Examples include recursive majority and percolation crossings in 
a torus. For symmetric functions there is an algorithm A computing / that reads on average 
m bits if and only if there is an algorithm A 1 computing / with 5 < m/n. (To go from A 
to A', one just permutes the bits by a uniformly-random element from the automorphism 
group before applying A. The other direction is obvious, since the expected number of bits 
that A' reads is at most 5n.) Thus, for symmetric functions, estimates on 5 are equivalent 
to estimates on the expected number of bits that need to be read. 

In the next section we give some nearly optimal constructions of Boolean functions that 
may be evaluated so that every input bit is unlikely to be read: 

Theorem 1. 1. There is a balanced Boolean function with an algorithm that always cor- 
rectly evaluates it on any input and for which 5 = ®{n~ l l 2 ^\ogn) . 

2. There is a balanced Boolean function with an algorithm that correctly evaluates it on 
most inputs most of the time and for which 5 = Q{n~ l l 2 ). 

3. There is a balanced monotone Boolean function with an algorithm that always correctly 
evaluates it on any input and for which 5 = ©(n" 1 / 3 logn). 

4- There is a balanced monotone Boolean function with an algorithm that correctly eval- 
uates it on most inputs most of the time and for which 5 = G(n -1 / 3 ). 

The constructions of Theorem ^ are optimal except possibly for the factors of y/\ogn and 
logn, as the lower bounds that we prove in Sections 01 and 0] show: 

Theorem 2. 1. If an algorithm correctly computes a balanced Boolean function f : { — 1,1} 
{—1, 1} on most inputs most of the time, then 5 > 0(n -1 / 2 ). More generally and more 
precisely, Pr[algorithm is wrong] > ~ Var(/) — \n5 2 , holds regardless if f is balanced. 

2. If an algorithm correctly computes a balanced monotone Boolean function on all inputs 
all of the time, then 5 > 0(n~ 1//3 ). More generally and more precisely, if f is monotone 
but not necessarily balanced, then Var(/) < 8 3 ' 2 n 1 ' 2 . 



2 Constructions 



The constructions are based on directed percolation on certain graphs, and were inspired 
part by coupling-from-the-past on generic Markov chains |PW98] and in part by Rad- 



ford Neal's circular coupling |Nea02j . though background on these topics is not required to 
understand the constructions. 
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2.1 Directed percolation on the wrapped extended butterfly 

The examples we construct are based on a directed graph. There are various different choices 
that might work here, and the framework we have chosen, the wrapped extended butterfly, 
offers an explicit description and reasonably clear proofs. 

Define the wrapped extended butterfly Qh,w to be a directed graph with H x W vertices, 
where H is even and often a power of 2. The coordinates of a vertex are (h, t) where h and 
t are integers with < h < H and < t < W. Each vertex has in-degree two and 
out-degree two. The vertex (h, t) has directed edges leading to (2h mod H, t + 1 mod W) 
and (2h + 1 mod H, t + 1 mod W). When H = 2 d and W = d, the undirected unwrapped 
version of this graph is sometimes known as the "omega network" or "shuffle network" 
|Lei92t Chapter 3.8.1], which is isomorphic to the usual butterfly |Lei92|. Chapter 3.2.1] that 
is sometimes used in parallel computing architectures. The extended butterfly (when H = 2 d 
and W = 5d) has also been used in the construction of holographic proof systems JPS94J. 
We will for convenience take H = 2 d } but for our constructions W will be much larger, 
W = Q(d2 d ) for one construction and W = Q[2 d / 2 ) for the other. There are many graphs 
that we could have used in our constructions, but with the wrapped extended butterfly, the 
calculations come out nicely. 

We will refer to the set of vertices with last coordinate t as the "t th time slice" (where 
"time" is periodic), and the vertices with the same first coordinate as "points". 

We will consider two ensembles of random subgraphs of the wrapped extended butter- 
fly. In the non-monotone ensemble, which we use for the non-monotone construction, each 
vertex has exactly one of its two out-going edges included in the subgraph, and an input bit 
determines the least significant bit (i.e., the parity) of the first coordinate of the destination 
vertex. We think of the n = HW input bits as being associated with the vertices. In the 
monotone ensemble, which we use for the monotone construction, both possible out-going 
edges from a vertex have an input bit to determine whether or not the edge is present in the 
subgraph. Here the expected out-degree of each vertex is 1. A total of n = 2HW input bits 
(associated with the edges) are used. 

In the various constructions we will be interested in the directed cycles that may exist 
in this random subgraph defined by the input bits. A Las Vegas algorithm may determine 
the cycles that exist in the random subgraph by picking a uniformly random time slice and 
following all the paths forward in time until the starting time slice is reached. No additional 
bits of the input need to be read. Many of the paths merge early on, so following them all 
does not require as many reads as it might at first appear. As we shall see later, a Monte 
Carlo algorithm that works most of the time for most inputs may instead follow the paths 
starting from a smaller random set of vertices. 

2.2 The nonmonotone random subgraph ensemble 

Since the subgraphs in the nonmonotone ensemble have out-degree 1 at each vertex, there 
must be at least one directed cycle. The Boolean function can be any sort of balanced 
function of these cycles. For example, we could take the lexicographically smallest vertex 
that is in a cycle and in time slice 0, and take its bit. 

Lemma 3. Provided H = 2 d and W > d, this Boolean function is exactly balanced. 
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Proof. Because the wrapped extended butterfly is symmetric under the operation that flips 
the interpretation of the bits at a given time slice, a random subgraph is isomorphic to the 
subgraph obtained by flipping the bits at a time slice and suitably permuting the bits at the 
next d time slices. □ 



Remark: If we wish the Boolean function to be symmetric in its input bits while remaining 
exactly balanced, we can do this via a trick that places four input bits bhj,o, ■ ■ ■ , bh,t,3 & t each 
vertex (h, t). The parity (6/^,0 ©• • •© 6/1,4,3) of these four bits determines which outgoing edge 
from vertex (h, t) is used. These four bits also determine another bit b' h t that is symmetric 
in the four bits and independent of parity: this bit b' h t is 1 iff there are exactly one or two 
ones among (6/i,t,o, &/i,t,i, 6/i,t,2, 6/1,4,3) and these are consecutive in the cyclic order. We can 
take our Boolean function / to be the XOR of these b' h t bits at all vertices that lie in a cycle. 
For any H = 2 d and W > d, the Boolean function is symmetric in the input bits and exactly 
balanced. 



Lemma 4. In the nonmonotone ensemble, the probability that the Las Vegas algorithm reads 
a bit is at most C^IU^ 1 log H + H' 1 ) . 

Proof. Let p t be the number of vertices whose bits are read in a time slice t time units 
after the initial time slice. If we run the p particles independently forward log 2 H time steps 
from these locations, then their resulting locations would be independent and uniformly 
random. Under this independent random walks dynamics, when two particles reach a vertex, 
their outgoing edges may be different. If instead we consider coalescing dynamics, each 
vertex has one outgoing edge (and these outgoing edges are chosen uniformly independently), 
and all the particles arriving at the vertex always use that same edge. We may couple 
the coalescing dynamics with the independent random walks dynamics in such a way that 
the occupied locations at the end of the walks in the coalescing dynamics is a subset of 
the locations in the independent random walks dynamics. Suppose that two independent 
uniformly random particles in the same time slice do non-coalescing independent random 
walks on the wrapped extended butterfly for s steps each, where s < W, and let N be the 
number of collisions, that is, vertices visited by both. Then E[7V] = s/H. Moveover, given 
that the particles visited the same vertex, the probability that they will be at the same vertex 
t time slices later is max{2~', 1/H}. Consequently, E[iV|iV > 0] < 2 + s/H, and therefore 
Pr[iV > 0] = E[N]/E[N\N > 0] > s/(2H + s). Regardless of what the two particles do, 
the probability that a third particle (started from an independent and uniformly random 
location in their initial time slice) collides with at least one of them is at most 2s/ H. Thus 
the probability that a given pair of particles collide but that neither one collides with another 
particle is at least 

s 



2H + s 



pt 



Under the coalescing dynamics, the probability of this event can only be larger. The number 
of such events is at most the reduction in the number of particles, so we find 

pt 

E[p t+ i og2jff+s |p t ] <p t - 







\2J2H + s 
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Taking s ~ H/pt, we see that the expected reduction (that is, K[p t — p t+ \o g2 H+s\pt\) is Q(pt)- 
Let Tk be the number of time slices for which 2 k < p t < 2 fc+1 . From the above, we see 
that E[T fc ] = 0(F/2 fc + logtf). Thus £ fc 2 fc+1 E[T fc ] < 0(H log H). After the paths have all 
coalesced, there is one path that continues for 0(W) steps. Since the expected number of 
input bits read is 0(HlogH + W), and each of the HW input bits is read with the same 
probability, we obtain the desired bound on 5. □ 

The optimal choice is H = 6(yn/ logn) and W = 0(Vnlogn), giving S = O {n^ 1 / 2 ^/\ogn) , 
which gives part 1 of Theorem 1. 

To obtain part 2 of Theorem 1 we instead take W = H . The Las Vegas algorithm would 
have 5 = 0(ra _1//2 logn), but we can save the factor of logn by settling for an algorithm that 
reports the correct answer most of the time on most inputs. Rather than locating the set 
of cycles by trying all possible starting points at a random time slice and following them 
forward W steps, we instead pick only m points from the uniformly selected time slice t , 
and follow them forward until they cycle. Here, m is large, but depends only on the required 
reliability of the algorithm. Given to, the m points are selected at random, uniformly and 
independently. 

Lemma 5. Assuming W = H , the probability that this algorithm reads any specific bit is 
0(m/H) 

Proof. We prove this in the case m = 1, which is clearly sufficient. Moreover, by symmetry, 
it is enough to show that the expected number of vertices visited is 0(W). We start with a 
single particle, and follow its path. Let X be the number of vertices visited. If the path did 
not enter its previously visited vertices after s steps, where so > W, then the conditioned 
probability that it will hit itself in the next s + log 2 H steps is at least s/(2H + s), by 
the argument giving the lower bound for Pr[iV > 0] in the proof of Lemma 0J Taking 

s = H - log 2 H, say, we find that Pr[X > (j + 1)W\X > jW] < 3/4 for every j = 1, 2, 

Consequently, E[X] = 0(W). □ 

To complete the argument, we need to show that with probability tending to 1 as m — > oo 
(uniformly in H), the algorithm finds all open cycles. This follows from the following lemma. 

Lemma 6. Assuming W = H, the expected number of open cycles that are undiscovered by 
the algorithm is 0(l/(m+ 1)). 

Proof. Let {vq,Vi, . . . ,v m } be a set of m + 1 uniformly chosen vertices in the to time slice, 
which are chosen independently given to- Let F be the union of the paths of length W/2 
starting at {v\, V2, ■ ■ ■ , v m }. We are going bound the probability for the event A that vq is 
on an open cycle that does not intersect T. Let 7 be the path of length W/2 starting at v . 

If we were to follow all H particles at time slice to forward in time, let Tj be the number 
of time steps before there are at most j particles. Because Prfr^ > [2E[r,]]] < 1/2 and 
successive blocks of [2E[r 7 ]] time slices are independent, Pr[rj > £[2E[rj]]] < 1/2^. Provided 
j is not too large (j < H/(logH log log iJ)), our bound for E[Tfc] in Lemma |U implies that 
E[Tj] = 0(H/j), so we can take £ = Q(j) to find that Pr[rj > H/2] decays geometrically in 
3- 
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Let Y be the number of particles starting at {vq, ■ ■ ■ , v m } that have not merged with any 
other particle after W/2 steps; we have K[Y] < 0(1). By symmetry Pr[7 fl T = 0|Y] = 
Y/(m+l), so in fact Prpyfir = 0] < 0(l/(m + l)). Given 7 and T, the probability that the 
continuation of 7 hits vq on first return to the to time slice is exactly 1/H. Moreover, the 
same argument as in LemmaEl shows that given 7 and V the probability that the continuation 
of 7 hits vq at the j th return to time slice to (but not sooner) is bounded by 0((3/4)- ? /H). 
Thus, we have Pr[*4] < 0(l/((m+ 1)H)). Since the number of open cycles that are disjoint 
from T is at most the number of vertices in the to time slice that are on such cycles, this 
proves the lemma. □ 



2.3 The monotone random subgraph ensemble 

Since the subgraphs in the monotone random subgraph ensemble have random out-degree, 
we are not assured that there is a directed cycle, and when W 3> \[Tl it is even unlikely 
for there to be a directed cycle. This suggests that we take our Boolean function to be the 
existence of a directed cycle, or what turns out to be a little simpler to analyze, the existence 
of a directed cycle of length W. 

Lemma 7. In the monotone ensemble, the probability that the Las Vegas reads a bit is at 
most (4 + o(l))VT~ 1 logW / . 

Proof. Let v be the starting vertex of the edge assigned to the given input bit, and suppose 
that the initial time slice was t steps before v. Let G v be the subgraph of the wrapped 
extended butterfly which consists of all directed paths of length at most W terminating at v. 
Each edge of G v occurs in the random subgraph with probability 1/2. There is an obvious 
graph covering of G v by the binary tree of depth W . An upper bound for the probability 
that v is explored conditioned on t is therefore the probability that the root of the binary 
tree is connected to some vertex at level t in a subgraph where each edge is included with 
probability 1/2 independently. (See, e.g., {££11] or |BS96[ Theorem 1].) Let S t be this 
latter probability. We have S t+ i — S t — jSf, so for large t, S t ~ 4/t. The probability that a 
vertex is explored is upper bounded by 

^E^(4 + o(l))^. 
t=o 

In the monotone ensemble, let us consider the cycles that wind around exactly once 
(cycles with length W). Assuming W > d, these cycles are in bijective correspondence with 
bit strings of length W: given a cycle, the bit string is formed by the parities of the vertical 
coordinates, and given a bit string, the vertical coordinates are determined by groups of d 
consecutive bits (in circular order). Let N HjW be the number of such cycles that occur in 
the graph. Since there are 2 W possible such cycles, each occuring with probablity 2~ w , we 
have E[N H ,w] = 1- 

Remark: As we shall see, the interesting regime is when W = Q(y/~H). It is not hard to 
show lim 1<w<v ^Pr[Af#,w > 0] = 1 — 1/e and lim 1<v ^- <iy Pr[N HjW > 0] = 0. Presumably 
linifl^oo P?[N H cy T£[ > 0] is a continuous function of c, in which case there is some particular 
positive value of c that we can pick so that Pr[iV Hcv ^ > 0] = 1/2, so that our Boolean 
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function is nearly balanced. liniH_>oo Pi[N Hc ^jj > 0] is an interesting function of c, but not 
one that we shall explore here. For our purposes it suffices to use a second moment estimate. 

To compute E[iV^ w ], we are interested in pairs of circular bit strings. Some pairs of 
cycles share edges, and for these pairs the probability that both cycles occur will be larger 
than 2~ 2W . Define a merge time to be a time slice where the two cycles coincide on the 
outgoing edge but not the incoming edge, and a split time to be a time slice where the 
two cycles coincide on the incoming edge but not the outgoing edge. The splits and merges 
alternate, and after a split, the next merge cannot occur within the next d time slices, but 
there are no other constraints on when the splits and merges may occur. Suppose there are 
£ splits and £ merges — these occur at distinct times, so there are at most 2(^) ways to 
select the times during which the two cycles share an edge. (When £ = 0, there is one way 
to select the merge and split times, but still two ways to decide which edges agree between 
the cycles.) Given the times during which the cycles share edges, how many pairs of cycles 
satisfy these constraints? There are 2 W ways to select the first cycle, each occuring with 
probability 2~ w . Every merge specifies the preceding d+ 1 bits in the second cycle, and the 
bit for every shared edge as well as every edge at a split time is also specified. The number 
of ways to pick the second string is at most 2 W ~ S / (2H) e , where S is the number of shared 
edges between the two cycles. The probability that the second cycle occurs given that the 
first one does is 2~ w+s . The expected number of pairs of cycles with £ splits and £ merges 
is at most 

W 2e 1 
(2£)\ (2H) e " 

We may sum over all £ to conclude E[Njj W ] < exp(W/V2H) + exp(-W/V2H). Since 
Pt[N H:W > 0] > E[N HiW ] 2 /E[N 2 HiW ], we have 

Pt[N h , w > 0] > 



exp(W/V2H) + exp(-W/V2H) 



We can pick that value of c for which l/(e c + e~ c ) = 1 — \j\[2 (c = 1.12838) and set W = 
[c\ / 2H\ . Instead of asking about the existence of a cycle of length W, we instead ask if there 
is a cycle that passes through a suitable set of vertices at the first time slice. The probability 
that a particular vertex is part of a cycle is at most l/H, the probability that a vertex is part 
of a cycle while the lexicographically smaller ones aren't will be even smaller, so by adjusting 
the size of the set of vertices we can ensure 1 — 1/ \/2 < Pr[suitable cycle exists] < 1 — 1/ \[2 + 
l/H. If a cycle passes through the last node in the suitable set, then call it a marginally 
suitable cycle, otherwise call it completely suitable: Pr[completely suitable cycle exists] < 
1 - l/y/2. We may repeat this experiment twice (with new bits for the second experiment) 
while taking H = 0(n 2 / 3 ) and W = [c\ / 2H\ in both experiments. If there is a completely 
suitable cycle in either experiment, the Boolean function takes the value 1, if there is no 
suitable cycle in either experiment, the Boolean function takes the value 0. In the third 
scenario, where there is a marginally suitable cycle but not a completely suitable cycle, 
we may tweak the definition of the Boolean function so that it is monotone and exactly 
balanced. This third scenario occurs with probability 0(n~ 2//3 ), so it can increase 5 by at 
most 0(n~ 2 / 3 ), so we still have 5 < 0(n~ 1//3 logn), giving us part 3 of Theorem 1. 
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For part 4 of Theorem 1, we can use the same Boolean function that we used for part 3, 
and use the same choice of W and H. As before, we can save the factor of logn by choosing 
a typically smaller collection of starting points. Let m be a large constant that depends only 
on the required reliability of the algorithm. The algorithm chooses a random set S of vertices 
in the extended butterfly, where each vertex is in S with probability m/H, independently. 
The algorithm then explores all the open paths of length at most 2W starting at the vertices 
in S. 

As above, 5 can be bounded by a corresponding process on the binary tree of depth 2 W. 
In this case, 5 is bounded by the expected number of "selected" vertices in the percolation 
component of the root, where each vertex is selected with probability m/H independently 
from other vertices and from the percolation process. Since the expected number of vertices 
in the percolation cluster of the root and at distance s from the root is 1 (when s < 2 W) , 
we get 5 < 2Wm/H = 0(mn -1 / 3 ). 

The proof of the theorem is completed by showing that this algorithm finds all open 
cycles of length W with high probability. This follows from the following lemma. 

Lemma 8. Assuming y/m < W = Q(\/H~), the expected number of open cycles of length W 
unexplored by this algorithm is at most c~^™ for some constant c G (0, 1). 

Proof. Fix some cycle 7 of length W. Let A be the event that 7 is open and undetected 
by the algorithm. Set Wq = [W/ \/m\ , and let 70 be a subpath of 7 with Wo vertices. Let 
N be the number of open paths of length at most Wq that start at a vertex in S, end at a 
vertex in 70, and are otherwise disjoint from 7. We will now prove that Pr[A > 0] = 6(1) 
by a second moment argument. 

Fix some k e {0, 1, 2, ... , Wo}. The number of paths of length k that end at a vertex in 
7o but are otherwise disjoint from 7 is O(W 2 k ). Thus 



The estimation of the second moment is similar to the one done above for Nh,w- We can 
bound E[A 2 ] by considering separately the four cases where the pair of paths have the same 
or different starting points and the same or different ending points. We enumerate the pairs 
of paths by tracing them backwards from 70. Let t be the number of splits (merges in 
reverse), and let k\ and k 2 be the lengths of the two paths. We find E[A 2 ] is at most 



E[JV] = Q ( W ° 2fc ) 2 ~ k 7? = ( 1 ) • 



fc=0 



different start, different end different start, same end 



fci=0 



W 2 kl 2 




+ 



and thus Pr[A > 0] = 0(1). 
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We may conclude that for every k — 0,1, ... , [y/m/2\ — 1 the probability that 7 is hit in 
a time slice t E [2 k Wo, (2 k + 1) Wq) by an open path of length at most Wq starting in S is 
0(1). Since these events are independent, the probability that 7 is not visited by an open 
path of length at most Wq starting in S is c _v/ ™ for some constant c E (0, 1). The proof is 
complete by noting that there are 2 W possible cycles 7, and each is open with probability 
2~ w . □ 



3 Lower bound for balanced Boolean functions 

Suppose that a randomized algorithm approximately computes a Boolean function. Let 
A{r, z) denote the output of the algorithm when it uses coins r = r\r-i ■ ■ ■ on input z = 
ziz-2 - • • z n . As usual we let 5i = Pr[bit i gets read]. Consider two independent runs of the 
algorithm (i.e., using independent coins r = r±r 2 • • • and s — SiS 2 • • • ) on independent inputs 
x = x±x 2 ■ • ■ x n and y = yiy 2 • • 'Vn- Let iV be the number of bit positions that are read by 
both of these independent runs: 

Pr[N > 0] < E[N] =J2 5 i< n $ 2 - 

i 

We define z — Z]_z 2 • • ' Zn by 

{Xi if algorithm with coins r and input x reads bit % 
Hi otherwise. 

The vector z is uniformly random and independent of r and s. Of course A(r,z) = A{r,x). 
In the event that N = 0, we have A(s,z) = A(s,y). Since A(r,x) and A(s,y) are i.i.d., 

Pv[A(r, z) = A{s, z)] < Pv[A(r, x) = A{s, y)\ + Pr[N > 0] 

< Pr[A(r, x) = -l] 2 + Pv[A(r, x) = l] 2 + n5 2 . 

Set w = Pv[A(r, x) ^ f(x)}, p = Pr[f(x) = 1] and p' = Pv[A(r, x) = 1]. If A(r, z) ^ A(s, z), 
then either A(r, z) ^ f(z) or A(s,z) ^ f(z). Thus, 

w> \{l-p' 2 -{l-p'f-n5 2 ) =p'-p' 2 -l n 5 2 . 

Now note that the absolute value of the p-derivative of Var(/)/4 = p — p 2 is bounded by 1. 
Thus, p' — p' 2 >\ Var(/) — \p — p'\ > \ Var(/) — w. Consequently, 

2w>-^U)- l -n5 2 , 

as required. 
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4 Lower bound for balanced monotone Boolean func- 
tions 

Suppose that / : {— l,l} n — ► 1R is some function, and A is a randomized algorithm calculating 
/ (exactly and always). Recall the definition of the Fourier coefficients 

f(S):=E[f(x)Y[x % ], Sc{l,2,...,n}. 

When / is monotone and takes values in {—1,1}, we clearly have 

h(f) = /({<}) • 

We will need the inequality 

n 

£/({*}) <V^. (1) 

i=l 

This inequality holds for / taking values in {—1, 1}, even if / is not monotone. It is ob- 
tained by combining the first two displayed inequalities in the proof of Theorem 1 in [OS04 
(although the algorithms discussed there are deterministic, the proof applies to random 
algorithms as well). Alternatively, the case k = 1 of the inequality 

£/(<f?) 2 <^ II/IH (2) 

\S\=k 



2 n 



from |SS04j gives 

i=l i=l 

implying (Although (J2J) does not assume that / is monotone or boolean, in the last step 
we assumed that / takes values in {—1, 1} to drop the factor ||/|||.) 

Another inequality that we need to quote (valid for boolean but not necessarily monotone 
/)is 

n 

Var(/)<E*W)- (3) 

i=l 

See 



Now assume that / : {—1,1}" — > {—1,1} is monotone. Since h(f) = /({*})> inequali- 
ties © and (JTJ) give 

which proves part 2 of Theorem |21 
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